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ABSTRACT 


Classification and identification of synthetic flavor become routine activities 
in the flavor and food industry due to its application. As a modern olfactory 
technology, electronic nose (e-nose) has the possibility to be applied in these 
activities. This study aimed to evaluate an e-nose for classifying synthetic 
flavors. In this study, an e-nose was designed with an array of gases sensors 
as the main sensing component and principal component analysis (PCA) for 
the pattern recognition software. This research was started with preparation 
of the hardware, continued with preparation of sample, data collection, and 
analysis. There were nine samples of synthetic flavors with different aroma, 
namely: grapes, strawberry, mocha, pandanus, mango, jackfruit, orange, 
melon, and durian. The data collection process includes three stages, 1.e. 
flushing, collecting, and purging of 2 min, 3 min, 2 min respectively. These 
sensor responses were then analyzed for forming aroma patterns. Four pre- 
treatment methods were applied for the aroma pattern formation: absolute 
data, normalize of absolute data, relative data, and normalize of relative data. 
With the PCA for evaluation, the results showed that the absolute data 
treatment provided the best results, indicated from the distribution of aroma 
patterns that were grouped according to the type of samples. 
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1. INTRODUCTION 


In the food processing, a number of flavors can be formed or can be degraded [1]. On the one hand, 
flavor is the main quality attribute that determines the level of acceptance of a food product by consumers. 
Therefore, the existence of flavors in accordance with consumer tastes is very important so that a food 
product can be accepted. Flavor is the sensation that consumers feel when they consume food [1]. It comes 
from the interaction between taste, aroma, and texture. Of course, to get a strong flavor, taste and aroma 
components must be available in food products. Often, these components are available in small quantities 
that are below human detection capability and can be lost during processing or storage. Thus, the flavor of 
food products may not occur or do not give a strong flavor. For this reason, several food industries add a 
number of components of taste and aroma, called flavoring compounds, to create acceptable flavor for their 


food products. 
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Flavor is a food additive that affects the taste and aroma of food or beverages [2]. For aromatization, 
food industries usually use aroma concentrates, essences, extracts and individual compounds [1]. Based on 
the ingredients, flavors are divided into 2 types, namely natural flavors and synthetic flavors [3]. Food 
industries may prefer synthetic flavors over natural flavors because they are easy to be obtained and used [4]. 
So that synthetic flavor producers continue to innovate according to market demand. In terms of maintaining 
the quality of their product, flavor producers should carry out sensory tests. One of the most important 
sensory tests to do in the industry is testing of the smell. The sensory tests are also conducted regularly in 
food industries. Since the aroma becomes the most important quality attributes for food acceptability, the 
testing should be conducted regularly. Usually, industries conduct the sensory testing by applying a group of 
panelists [5]. However, it is not always possible for a panelist to perform consistent analysis, either due to the 
panelist's health condition or the panelist's capacity to manually analyze the aroma of a material. Therefore, 
an analytical instrument that has a function like the human olfactory organ is necessary. For both industries, 
food and synthetic flavor, the tool is specifically expected to be a quality control instrument. One of the 
important devices which is able to identify and classify smell is called an electronic nose (e-nose). 

Electronic nose (e-nose) is an instrument designed to imitate the working principles of the biological 
olfactory system of humans or animals. The e-nose is an instrument that combines a series of gas sensors and 
pattern processing techniques for detecting scents [6]. E-nose is also called olfactory electronics because it 
has the ability to mimic the way the human sense of smell works [7]. E-nose is needed in various processes, 
especially which require the role of sense of smell, including in the food and flavor industry. Several studies 
on electronic nose in detecting and classifying aroma that have been carried out are organic green teas [8], 
black tea classification [9] [10], herbal tea beverage discrimination [11], yogurt flavor assessment [12], 
analysis of ethanol in soy sauce [13], determination of trimethylamine in milk [14], honey nectar detection 
[15], and meat and fish assessment [16]. Referring to the previous research, in this work, the e-nose will be 
implemented to classify synthetic flavors. The focus of this test is to find out whether the electronic nose can 
classify several synthetic flavors using the principal component analysis (PCA) method [11], [14], [17]-[20]. 


2. RESEARCH METHOD 
2.1. Sample 

The sample used in this research are synthetic flavors produced by the same industry (same brand) 
which have different aromas. There were nine different aromas of flavor: grape, melon, jackfruit, pandanus, 
strawberry, durian, orange, mango, and mocha. Each sample of synthetic flavors was prepared as much as 5 
ml using a measuring cup. The sample is then put into the prepared cup and then placed inside the sample 
cup. The sample should remain inside the cup at least 1 minute before data collection begins. 


2.2. Hardware 

The hardware used in this study consisted of an e-nose, sample cup, carrier gas, and a computer. The 
e-nose used in this research was designed with four main parts, namely as a sample handling system, a series 
of gas sensors with suitable conditioning circuit as detector, an interface and controller for data acquisition, 
and computer for data collection and analysis. The part of sample handling was designed by combination of 
two pieces of three-way solenoid valve, sample cup, and carrier gas. The detector consisted of 13 MOS 
sensors. The use of MOS sensors which have a sensitivity to certain gas compounds for detecting part of an 
e-nose was also reported in [15], [17]-[18]. A signal conditioning circuit with a voltage divider was also 
applied in the e-nose. As also applied in [7], the e-nose also completed with a temperature-humidity sensor 
(DHT22). A microcontroller of Arduino Mega was applied for analog to digital converter (ADC), interface, 
and controller. The e-nose was developed in the Laboratory of Agricultural Energy & Machinery, Faculty of 
Agricultural Technology, Universitas Gadjah Mada. The e-nose system aims to obtain efficient gas sensor 
response so that the flow of sample and carrier gases are regulated. The working principle of the e-nose can 
be seen in Figure 1, which starts from the pressurized gas entering valve 1 then flows to valve 2 which can go 
through part | or part 2, then flows to valve 2, temperature and humidity sensors, gas sensors, and finally will 
be forwarded to the computer. 


2.3. Software 

The e-nose circuit controlled by the microcontroller is communicated to the computer via a 
universal serial bus (USB) line. To operate the e-nose, it is completed with a graphical user interface (GUI). 
Some menus available on the GUI include start, stop, save, reset, close button. The GUI functions as a 
display program and hardware controller in measuring samples, the data displayed is the response of the gas 
sensor, the response of the temperature and humidity sensors. Sample measurement consisted of one sniffing 
cycle, namely flushing, collecting, and purging as shown in Figure 2, this process has also been cried out in 
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[21], [22]. Flushing is the process of reading the sensor response without a test sample. The result of this 
process is a reference value for sensor readings when exposed to clean air. The flushing part is done for 2 
minutes. Collecting is the process of flowing the sample aroma into the sensor room so that the sensor 
response is exposed to the smell of the test sample, this process lasts 3 minutes. Purging is the process of re- 
cleaning the sensor chamber from the smell of the previous sample. This process lasts for 2 minutes. The 
result of this process is a decrease in the signal caused by the smell of the sample flowing into the sample 
chamber being cut off or closed by a valve. 
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Figure 1. Experimental setup of e-nose system, (a) Schematic design, (b) Real hardware 
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Figure 2. Process of sampling 


2.4. Analysis procedure 

The sensor response data from the sample exposure is converted to voltage using signal conditioners 
[1]. Furthermore, the data will be displayed on the computer in the form of an excel file. The data will be 
analyzed so that the pattern results of each synthetic flavor are obtained. The data analysis steps are explained 
as follow: 
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2.4.1. Pre-treatment data processing 

The data that has been stored in the form of an excel file is pre-treated with data processing which 
consists of 4 management methods, namely (a) the absolute data method, (b) normalize absolute data, (c) 
relative data, and (d) normalize relative data. Normalization baseline data have also been carried out by Liu 
et al. [23]. Prior to pre-treatment, it must be preceded by determining the constant range of data from the 
flushing and collecting processes. Absolute data is constant data obtained from the collecting process, 
normalize absolute data is constant data divided by the maximum value of the data, relative data is constant 
data from the collecting process minus constant data from the flushing process, and normalize relative data is 
the relative value of data divided by the maximum value of the data. The maximum value that is processed 
using the normalize method is 1. The purpose of this pretreatment process is to obtain the difference in 
pattern recognition between synthetic flavor samples. 


2.4.2. Principal component analysis 

Principal component analysis (PCA) is a method of data transformation with many variables that 
may interact with each other into new data with the same number of variables, but the new variables are not 
correlated. In this research, PCA based on covariance matrix is applied. This method was also applied in [7], 
includes a series step. The analysis is started by calculating the covariance matrix from the aroma patterns of 
samples, extracting the eigenvalue and eigenvector, then transforming the original data into the new data by 
using selected eigenvector. Each new data variable contains a number of variances which are important 
information from the data. This new variable is commonly called principal component (PC). In PCA, each 
PC is sorted based on the amount of the carried information, where PC-1 contains the largest variant, 
followed by PC-2, then PC-3, and so on to PC-n. Generally, the use of some early PCs has brought most of 
the information from the processed data so that it is possible to be visualized. The data obtained from the pre- 
treatment of data processing then analyzed using the PCA program. The results obtained from processing 
using the PCA method are graphs showing the relationship between the first component (x-axis) or called 
PC-1 and the second component (y-axis) or called PC-2. The PCA analysis graph will show the plot score of 
nine samples. Each point on the PCA analysis graph represents one replication per sample so that 1 test 
sample will produce 30 points (data) according to the test data replication. 


3. RESULTS AND DISCUSSION 
3.1. Gas sensor response 

Data collection begins with sample preparation. The sample that has been prepared with the same 
volume is then put in a closed sample container. E-nose is prepared for at least 2 minutes before use. By 
pressing the start button on the GUI, the data retrieval process starts. The data collection process for each 
sample was carried out for 7 minutes, which included flushing for 2 min, collecting for 3 min, and purging 
for 2 min. Data reading was carried out with a period of a second, so that in each sample there were 420 data 
from each sensor used. The data is represented in the form of voltage values. In this design, the e-nose 
contains 13 MOS sensors. Data of the sensor response to the presented sample is displayed in the GUI. 
Testing of e-nose without samples with the same method is carried out to determine the comparison of sensor 
responses before and after being presented the sample. The sensor response with the presentation of the 
sample can be seen in Figure 3. 
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Figure 3. Response of gas sensor with the presentation of sample 


In addition to the gas sensor array, the e-nose circuit used is also equipped with a temperature and 
relative humidity (RH) sensor. The temperature and RH data readings by the sensor are also displayed in the 
GUI. Sample of it can be seen in Figure 4. The change in humidity response in the collecting process shows 
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that the sample being presented may contain water. D. Lelono, K. Triyana, S. Hartati, and J. Istiyanto [24] the 
greater the sample room temperature in the data collection process, the greater the intensity of the sensor 
response. The temperature sensor response is consistent enough during the data collection process so that the 
temperature does not significantly affect the change in gas response. 
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Figure 4. Temperature and humidity of sample during sampling 


3.2. Results of principal component analysis 

In this study, PCA was used to map the aroma pattern data of all analyzed samples. This study 
analyzed nine samples of synthetic flavor available in the market. These include grape, melon, jackfruit, 
pandanus, strawberry, durian, orange, mango, and mocha flavors. By using the covariance matrix base, PCA 
generates a new data matrix which is transformed from the eigenvector values obtained. In this analysis, two 
PCs are used so that new data can be visualized in 2-dimensional graphical form. By applying PC-1 on the x- 
axis and PC-2 on the y-axis, aroma pattern data from the PCA results can be displayed. Figure 5 presents the 
distribution of aroma patterns obtained by the four pretreatment methods used: (a) absolute data, (b) 
normalized absolute data, (c) relative data, (d) normalized relative data. 

Based on the distribution of the aroma patterns of the analyzed samples, it can be seen that 
Figure 5(a) shows the best results and shows the distribution of clustered aroma patterns for data from the 
same sample and each sample tends to separate from one another. Based on this graph, this analysis shows 
that there are at least seven samples from nine samples that can be grouped. While the PCA for aroma pattern 
data analyzed with pretreatment absolute data were normalized as shown in Figure 5(b), the aroma patterns 
for each sample tended to blend with one another, making it difficult to identify. Meanwhile, the results of 
data analysis with pretreatment were relatively poor in terms of pattern grouping. The results of this analysis 
only show a few classifiable patterns (about three types of synthetic flavors). Likewise, for the analysis of the 
aroma pattern data formed by the pretreatment method of the normalized relative data. The analysis results 
show the distribution of aroma patterns that cannot be grouped. Based on these results, the pretreatment of 
absolute data method can be selected for the purpose of classification or identification of synthetic flavors by 
using an e-nose combined with PCA. 

Figure 5(a) shows that the samples are grouped and easily separated. Although there were some sample 
points that were mixed with other samples, most of the samples were able to be classified. Based on this graph, the 
distribution of aroma patterns presented in Figure 5(a) has presented 84% of the information from the data (60.9% 
from PC-1 and 23.1% from PC-2). The information value of 84% can certainly represent research data. Of course, the 
value of the information represented by this new data can be increased by adding another PC, for example PC-3. The 
addition of PC-3 can increase the amount of variance information of the initial data generated, which is possible to 
produce a better grouping of data. However, the use of 3 PCs in this analysis has not been carried out. Forward 
analysis with 3 PCs is possible to follow up this study. 

The results of the distribution of aroma patterns presented in Figure 5(a) show that there are a 
number of points that are close to each other even though these points come from different types of samples. 
The overlapping score plots may be caused by the similar content of compounds in the sample. An example 
is between the melon and strawberry flavor samples. I. Gonda et al. [25], melons and strawberries contain the 
same ester compounds, namely methyl acetate and ethyl acetate. This is also reinforced from other 
references, melons and strawberries have the same synthetic ester, namely ethyl 2 methylbutyrate. Other test 
samples that have overlapping points are strawberry and jackfruit flavor. B. T. Ong et al. [26], both samples 
have the same ester content, namely butyl acetate. Thus, most likely the cause of point crush between 
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samples is because the test sample contains the same ester compound. The results of PCA analysis on nine 
samples showed that the electronic nose was able to distinguish the synthetic taste that was seen from several 
samples that could be grouped well [27]. 
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Figure 5. Score plot of PCA with four different pretreatments, (a) Absolute data, (b) Normalized absolute 
data, (c) Relative data, (d) Normalized relative data 


4. CONCLUSION 

Study on the implementation of e-nose based on a gas sensor array for classification of synthetic 
flavors has been conducted. These MOS sensors provide a good response to the sample presentation 
indicated from the difference between sensor response and the baseline. Classification analysis using the 
PCA method, pre-treatment with absolute data analysis resulted in the best results. Based on PCA, by using 
two main components (PC-1 and PC-2), the distribution of aroma patterns formed by this treatment showed 
that at least seven samples can be calcified properly. The synthetic flavors classified well were the flavors for 
grape, melon, pandanus, strawberry, orange, mango, and mocha, while the samples of jackfruit and durian 
flavors tended to mix with other samples. From the result, the aroma pattern of strawberry and jackfruit 
tended to be mixed. 
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